Clean Kinematic Samples in Dwarf Spheroidals: an Algorithm for Evaluating Membership and Estimating Distribution Parameters When Contamination Is Present
نویسندگان
چکیده
We develop an algorithm for estimating parameters of a distribution sampled with contamination. We employ a statistical technique known as “expectation maximization” (EM). Given models for both member and contaminant populations, the EM algorithm iteratively evaluates the membership probability of each discrete data point, then uses those probabilities to update parameter estimates for member and contaminant distributions. The EM approach has wide applicability to the analysis of astronomical data. Here we tailor an EM algorithm to operate on spectroscopic samples obtained with the Michigan-MIKE Fiber System (MMFS) as part of our Magellan survey of stellar radial velocities in nearby dwarf spheroidal (dSph) galaxies. These samples, to be presented in a companion paper, contain discrete measurements of line-of-sight velocity, projected position, and pseudo-equivalent width of the Mg-triplet feature, for ∼ 1000 − 2500 stars per dSph, including some fraction of contamination by foreground Milky Way stars. The EM algorithm uses all of the available data to quantify dSph and contaminant distributions. For distributions (e.g., velocity and Mg-index of dSph stars) assumed to be Gaussian, the EM algorithm returns maximum-likelihood estimates of the mean and variance, as well as the probability that each star is a dSph member. These probabilities can serve as weights in subsequent analyses. Applied to our MMFS data, the EM algorithm identifies more than 5000 stars as probable dSph members. We test the performance of the EM algorithm on simulated data sets that represent a range of sample size, level of contamination, and amount of overlap between dSph and contaminant velocity distributions. The simulations establish that for samples ranging from large (N ∼ 3000, characteristic of the MMFS samples) to small (N ∼ 30, resembling new samples for extremely faint dSphs), the EM algorithm distinguishes members from contaminants and returns accurate parameter estimates much more reliably than conventional methods of contaminant removal (e.g., sigma clipping). Subject headings: galaxies: dwarf — galaxies: kinematics and dynamics — (galaxies:) Local Group — galaxies: individual (Carina, Fornax, Sculptor, Sextans) — techniques: radial velocities
منابع مشابه
An EM Algorithm for Estimating the Parameters of the Generalized Exponential Distribution under Unified Hybrid Censored Data
The unified hybrid censoring is a mixture of generalized Type-I and Type-II hybrid censoring schemes. This article presents the statistical inferences on Generalized Exponential Distribution parameters when the data are obtained from the unified hybrid censoring scheme. It is observed that the maximum likelihood estimators can not be derived in closed form. The EM algorithm for computing the ma...
متن کاملAn Adenosine Triphosphate Bioluminescence Method for Evaluating the Microbial Contamination of the Salad-Preparing Tables and Salad-Serving Dishes in Restaurants of Mashhad City, Iran
Introduction: Consumption of vegetable products is increasing commonly in the world because they are recognized as an important source of nutrients, vitamins, and fiber for humans. Salads are among the most widely used foods that are also known as the most contaminated foods in restaurants. This study was conducted to determine the microbial contamination of salad-preparing tables and salad-ser...
متن کاملAre " Dwarf " Ellipticals Genuine Ellipticals?
We review the systematic properties of “dwarf” elliptical (dE) galaxies, focussing on the relation between “normal” and “dwarf” ellipticals. In recent years, this relation has been described as “dichotomy” – based essentially on a discontinuity in central surface brightness. We show that, outside of 300 pc from the centre, the Sérsic profile parameters vary continuously from “normal” to “dwarf”...
متن کاملEstimating the Optimal Dosage of Sodium Valproate in Idiopathic Generalized Epilepsy with Adaptive Neuro-Fuzzy Inference System
Introduction: Epilepsy is a clinical syndrome in which seizures have a tendency to recur. Sodium valproate is the most effective drug in the treatment of all types of generalized seizures. Finding the optimal dosage (the lowest effective dose) of sodium valproate is a real challenge to all neurologists. In this study, a new approach based on Adaptive Neuro-Fuzzy Inference System (ANFIS) was pre...
متن کاملDevelopment of an evolutionary fuzzy expert system for estimating future behavior of stock price
The stock market has always been an attractive area for researchers since no method has been found yet to predict the stock price behavior precisely. Due to its high rate of uncertainty and volatility, it carries a higher risk than any other investment area, thus the stock price behavior is difficult to simulation. This paper presents a “data mining-based evolutionary fuzzy expert system” (DEFE...
متن کامل